Consensus clustering applied to multi-omics disease subtyping
نویسندگان
چکیده
Abstract Background Facing the diversity of omics data and difficulty selecting one result over all those produced by several methods, consensus strategies have potential to reconcile multiple inputs produce robust results. Results Here, we introduce ClustOmics, a generic clustering tool that use in context cancer subtyping. ClustOmics relies on non-relational graph database, which allows for simultaneous integration both results from various methods. This new conciliates input clusterings, regardless their origin, number, size or shape. implements an intuitive flexible strategy, based upon idea evidence accumulation . computes co-occurrences pairs samples clusters uses this score as similarity measure reorganize into clusters. Conclusion We applied multi-omics disease subtyping real TCGA ten different types. showed is heterogeneous qualities partitions, smoothing reconciling preliminary predictions high-quality clusters, computational biological point view. The comparison state-of-the-art consensus-based tool, COCA, further corroborated statement. However, main interest not compete with other tools, but rather make profit when no gold-standard metric available assess significance. Availability source code, released under MIT license, obtained are GitHub: https://github.com/galadrielbriere/ClustOmics
منابع مشابه
Clustering Gene Expression Regulators: New Approach to Disease Subtyping
One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) w...
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملLecture 9 : Multi Omics Clustering December 19 , 2017
Omic is a term used to describe a field of study in biology that utilizes a certain type of biological data (e.g. genomics is the study of genome), while multi omics is the usage of several types of omics data. During the course we’ve mainly discussed mRNA data. In the introduction lecture other types of biological data were shown, protein and DNA data (referred to as proteomics and genomics). ...
متن کاملConsensus Clustering + Meta Clustering = Multiple Consensus Clustering
Consensus clustering and meta clustering are two important extensions of the classical clustering problem. Given a set of input clusterings of a given dataset, consensus clustering aims to find a single final clustering which is a better fit in some sense than the existing clusterings, and meta clustering aims to group similar input clusterings together so that users only need to examine a smal...
متن کاملUsing “omics” and integrated multi-omics approaches to guide
37 38 Running title: Using multi-omics for probiotic selection 39 40
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2021
ISSN: ['1471-2105']
DOI: https://doi.org/10.1186/s12859-021-04279-1